A New Data Format for Czech Morphological Analysis

نویسنده

  • Pavel Smerk
چکیده

The paper presents a new data format for computational morphology of Czech. The new format allows for a significant reduction of a redundancy yielded by existing formats. It is also much more linguistically interpretable and acceptable. The paper shows that there is no need to develop any computer-specific description of morphology, but that the traditional linguistic description suffices quite well.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Czech Morphological Analyser ajka

This paper deals with the effective implementation of the new Czech morphological analyser ajka which is based on the algorithmic description of the Czech formal morphology. First, we present two most important word-forming processes in Czech — inflection and derivation. A brief description of the data structures used for storing morphological information as well as a discussion of the efficien...

متن کامل

Towards Czech Morphological Guesser

This paper presents a morphological guesser for Czech based on data from Czech morphological analyzer ajka [1]. The idea behind the presented concept lies in a presumption that the new (and therefore unknown to the analyzer) words in a language behave quite regularly and that a description of this regular behaviour can be extracted from the existing data of the morphological analyzer. The paper...

متن کامل

Fast Morphological Analysis of Czech

This paper presents a new Czech morphological analyser which takes an advantage of Jan Daciuk’s algorithms for minimal deterministic acyclic finite state automata. The new analyser is six times faster than the current analyser ajka concerning the proper analysis, i.e. returning possible lemmata and tags for a given word form, but for some other related tasks is the difference even bigger.

متن کامل

Merging Data Resources for Inflectional and Derivational Morphology in Czech

The paper deals with merging two complementary resources of morphological data previously existing for Czech, namely the inflectional dictionary MorfFlex CZ and the recently developed lexical network DeriNet. The MorfFlex CZ dictionary has been used by a morphological analyzer capable of analyzing/generating several million Czech word forms according to the rules of Czech inflection. The DeriNe...

متن کامل

Structural, morphological and optical characterization of green synthesized ZnS nanoparticles using Azadirachta Indica (Neem) leaf extract

ZnS nanoparticles have been synthesized using various amounts of aqueous Azadirachta Indica (Neem) leaf extract as capping agent and stabilizer. The synthesized nanoparticles were studied by FTIR, powder X-ray diffraction (XRD), scanning electron microscopy (SEM), transmission electron microscopy (TEM), energy dispersive analysis of X-rays (EDAX) and UV-Visible spectroscopy. FTIR spect...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010